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For some time the discrete strategy improvement algorithm due to Jurdzinski and Voge had been 
considered as a candidate for solving parity games in polynomial time. However, it has recently been 
proved by Oliver Friedmann that the strategy improvement algorithm requires super-polynomially 
many iteration steps, for all popular local improvements rules, including switch-all (also with Fearn- 
ley's snare memorisation), switch-best, random-facet, random-edge, switch-half, least-recently-con- 
sidered, and Zadeh's Pivoting rule. 

We analyse the examples provided by Friedmann in terms of complexity measures for directed 
graphs such as treewidth, DAG-width, Kelly- width, entanglement, directed pathwidth, and cliquewidth. 
It is known that for every class of parity games on which one of these parameters is bounded, the 
winning regions can be efficiently computed. It turns out that with respect to almost all of these 
measures, the complexity of Friedmann's counterexamples is bounded, and indeed in most cases by 
very small numbers. This analysis strengthens in some sense Friedmann's results and shows that the 
discrete strategy improvement algorithm is even more limited than one might have thought. Not only 
does it require super-polynomial running time in the general case, where the problem of polynomial- 
time solvability is open, it even has super-polynomial lower time bounds on natural classes of parity 
games on which efficient algorithms are known. 

1 Introduction 

Parity games are a family of infinite two-player games on directed graphs. They are important for several 
reasons. Many classes of games arising in practical applications admit reductions to parity games (over 
larger game graphs). This is the case for games modelling reactive systems, with winning conditions 
specified in some temporal logic or in monadic second-order logic over infinite paths (SIS), for Muller 
games, but also for games with partial information appearing in the synthesis of distributed controllers. 
Further, parity games arise as the model-checking games for fixed-point logics such as the modal /u- 
calculus or LFP, the extension of first-order logic by least and greatest fixed-points. Conversely, winning 
regions of parity games (with a bounded number of priorities) are definable in both LFP and the p- 
calculus. Parity games are also of crucial importance in the analysis of structural properties of fixed-point 
logics. 

From an algorithmic point of view parity games are highly intriguing as well. It is an immediate 
consequence of the positional detenninacy of parity games, that their winning regions can be decided in 
NP n Co-NP. In fact, it was proved in ifTTTl that the problem is in UP n Co-UP, where UP denotes the 
class of NP-problems with unique witnesses. The best known deterministic algorithm has complexity 
n O( yIn) |Tjjj For parity games with a number d of priorities the progress measure lifting algorithm by 
Jurdzinski [12] computes winning regions in time 0{dm - (2n/(d/2)) d / 2 ) = 0(n d ^ 2+0 ^), where m is the 
number of edges, giving a polynomial-time algorithm when d is bounded. The two approaches can be 
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combined to achieve a worst-case running time of 0(n d/3+0 ^) for solving parity games with d priorities, 
with d = yn (see [1 , Chapter 3]. 

Although the question whether parity games are in general solvable in Ptime is still open, there 
are efficient algorithms that solve parity games in special cases, where the structural complexity of the 
underlying directed graphs, measured by numerical graph parameters, is low. These include parity games 
of bounded treewidth lfT51 . bounded entanglement (3JIH, bounded DAG- width O, bounded Kelly- width 
iflOl . or bounded cliquewidth ifToll . 

One algorithm that, for a long time, had been considered as a candidate for solving parity games 
in polynomial time is the discrete strategy improvement algorithm by Jurdzihski and Voge iTffl . The 
basic idea behind the algorithm is to take an arbitrary initial strategy for Player and improve it step by 
step until an optimal strategy is found. The algorithm is parametrized by an improvement rule. Indeed, 
there are many possibilities to improve the current strategy at any iteration step, and the improvement 
rule determines the choice that is made. Popular improvement rules are switch-all, switch-best, random- 
facet, random-edge, switch-half and Zadeh's Pivoting rule. Although it is open whether there is an im- 
provement rule that results in a polynomial worst-case runtime of the strategy improvement algorithm, 
Friedmann |9] was able to show that there are super-polynomial lower bounds for all popular improve- 
ment rules mentioned above. For each of these rules, Friedmann constructed a family of parity games on 
which the strategy improvement algorithm requires super-polynomial running time. 

In this paper we analyse the examples provided by Friedmann in terms of complexity measures for 
directed graphs. It turns out that with respect to most of these measures, the complexity of Friedmann's 
counterexamples is bounded, and indeed in most cases by very small numbers. This analysis strengthens 
in some sense Friedmann's results and shows that the discrete strategy improvement algorithm is even 
more limited than one might have thought. Not only does it require super-polynomial running time in 
the general case, where the problem of polynomial-time solvability is open, it even has super-polynomial 
lower time bounds on natural classes of parity games on which efficient algorithms are known. 

2 The strategy improvement algorithm 

We assume that the reader is familiar with basic notions and terminology on parity games. We shall 
now briefly discuss the discrete strategy improvement algorithm and the different improvement rules that 
parametrize it. For the purpose of this paper a precise understanding of the algorithm is not needed. The 
idea of the strategy improvement algorithm is that one can compute an optimal strategy of a player by 
starting with an arbitrary intitial strategy and improve it step by step, depending on a discrete valuation 
of plays and strategies, and on a rule that governs the choices of local changes (switches) of the current 
strategy. 

It is well-known that parity games are determined by positional strategies, i.e. strategies which at 
each position just select one of the outgoing edges, independent of the history of a play. The discrete 
valuation defined by Jurdzihski and Voge lfl4l measures how good a play is for Player in a more 
refined way than just winning or losing. Given a current strategy one can then, at each position of 
Player 0, consider the possible local changes, i.e. the switches of the outgoing edges, and select a locally 
best posibility. Rules that describe how to combine such switches in one improvement step are called 
improvement rules and parametrise the algorithm. 

The switch-all or locally optimizing rule [ 14 ] regards each vertex independently and performs the 
best possible switch for every vertex. In other words, for every vertex, it computes the best improvement 
of the strategy at that vertex assuming that the strategy remains unchanged at other vertices. However, the 
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switch is done simultaneously at each vertex. The switch-best or the globally optimising rule takes cross- 
effects of improving switches into account and applies in every iteration step a best possible combination 
of switches. 

The random-edge rule applies a single improving switch at some vertex chosen randomly and the 
improvement rule switch-half applies an improving switch at every vertex with probability 1/2. The 
random-facet rule chooses randomly an edge e leaving a Player vertex and computes recursively a 
winning strategy cr* on the graph without e. If taking e is not an improvement, cr* is optimal, otherwise cr* 
switched to e is the new initial strategy. The least-entered rule switches at a vertex at that the least 
number of switches has been performed so far. Cunningham's least-recently-considered or round-robin 
rule fixes an initial ordering on all Player vertices first, and then selects the next vertex to switch at in a 
round-robin manner. Fearnley introduced in [6] snare memorisation. It can be seen as an extension of a 
basic improvement rule by a snare rule that memorises certain structures of a game to avoid reoccurring 
patterns. 

All local improvement rules discussed here can be computed in polynomial time lfl4l [T71 . Hence 
the running time of the algorithm on a game depends primarily on the number of improvement steps. In 
a series of papers and in his dissertation, Friedmann has constructed, for each of the above mentioned 
improvement rules, a class of parity games on which the strategy improvement algorithm requires super- 
polynomially many iteration steps, with respect to the size of the game [9, 8,7]. We shall analyse these 
games in terms of certain complexity measures for directed graphs which we describe in the next section. 

3 Complexity measures for directed graphs 

For most of the complexity measures we shall work with a characterisation in terms of so-called graph 
searching games, that allow us a more intuitive point of view on the measures and give us an easier way 
to analyse the graphs in question. A graph searching game is played on a graph by a team of cops and a 
robber. In any position, the robber is on a vertex of the graph and each cop either also occupies a vertex 
or is outside of the graph. The robber can move between vertices along cop-free paths in the graph, i.e. 
paths whose vertices are not occupied by cops. The moves of the cops have typically no restrictions. 
The aim of the cops is to capture the robber, i.e. to force him in a position where he has no legal moves. 
Precise rules of moves characterise a complexity measure of the graph. The value of the measure is the 
minimal number of cops needed to capture the robber ( - 1 in some cases). Hence, on simple graphs, few 
cops suffice to win wheras complex graphs demand many cops to capture the robber. 

In that way, treewidth, DAG-width, Kelly-width, directed pathwidth and entanglement can be de- 
scribed. Another measure that we shall consider is cliquewidth, for which no characterisation by games 
is known. Recall that common definitions of such measures are usually given by means of appropriate 
decompositions of the graph into small parts that are connected in a simple way: as a directed path for 
directed pathwidth, as a tree for treewidth, as a DAG for DAG-width and Kelly-width, or as a parse tree 
for cliquewidth. The maximal size of a part in a decomposition corresponds to the minimal number of 
cops needed to capture the robber on the graph (except for entanglement, for which no corresponding 
decomposition is known). Such decompositions can be used to provide efficient algorithms for problems 
that are difficult (e.g. NP-complete) in general, on graph classes where the values of the respective mea- 
sure is bounded. In particular, this is the case for parity games. In a series of papers it has been shown 
that parity games can be solved in Ptime on graph classes with bounded treewidth, directed pathwidth, 
DAG-width, Kelly-width, entanglement or cliquewidth. In the following, we define all complexity mea- 
sures discussed above by their characterisations in terms of graph searching games except cliquewidth, 
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for which we give an inductive definition. 

Treewidth. Treewidth is a classical measure of cyclicity on undirected graphs. It measures how close 
a graph is to being a tree. The treewidth game twG/t(£r) is played on an undirected graph Q = (V,E) by 
a team of k cops and a robber, whereby k is a parameter of the game. Initially, there are no cops on the 
graph and the robber chooses an arbitrary vertex and occupies it in the first move. The players move 
alternating. A cop position is a tuple (C,v) where C QV with \C\ < & is the set of vertices occupied by 
cops (if k > \C\, the remaining cops are considered to be outside of Q) and v g C is the vertex occupied by 
the robber. The cops can move to a position (C,C',v) with |C'| < k. Intuitively, they announce their next 
placement C and take cops from C \ C away from Q. The robber positions are of the form (C,C',v). 
The robber can run along paths on the graph whose vertices are not occupied by cops, i.e. the next (cop) 
position may be (C',w) where w e Reach^_(cnc')( v ) \ C> ie - w * s reachable from v in Q - (C D C). Thus 
the cops are placed on the vertices they announced in their previous move; furthermore, only those cops 
prevent the robber to run who are both in the previous and in the next placements. However, the robber 
is not permitted to go to a vertex which will be occupied by a cop in the next position. 

The robber is captured in a position (C, C',v) if he has no legal move: all neighbours of v are in C n C 
and a cop is about to occupy his vertex, i.e. v e C. A play is monotone if the robber can never reach a 
vertex that has already been unavailable for him. It suffices to demand that in any move, the robber is 
not able to reach a vertex that has just been left by a cop. Formally, a play is monotone if, for every cop 
move (C,v) -> (C,C',v) in the play, we have Reach£_ (CnC ')(v) n (C\ C) = 0. 

The cops win a monotone play if it ends in a position in that the robber is captured. Infinite or non- 
monotone plays are won by the robber. A (positional) strategy for the cops is a function cr: 2 V x V — > 2 V 
which prescribes, for every cop position (C,v), the next placement cr(C,v). Similarly, a (positional) 
strategy for the robber is a function p: 2 V x 2 V — > V which maps a robber position (C,C',v) to a cop 
position (C',w) with w e Reach^-(cnc')( v )- A play n is consistent with a strategy cr (or p) if every move 
in the play is made according to cr (or p). A strategy for a player is winning if he wins every play 
consistent with that strategy. As the winning conditions for both players are Boolean combinations of 
reachability and safety, it suffices to consider only positional strategies. 

The minimal number k such that the cops have a winning strategy in twG*+i(^) is the treewidth 
tw(@) of Q. If Q = (V,E) is a directed graph then tw(^) is tw(^) where Q = (V,E) and E is the symmetric 
closure of E. 

DAG-width. The DAG-width game dagwG )t (^) is played on a directed graph Q in the same way 
as the treewidth game, but the edge relation of the graph is not symmetrised. Note that the meaning of 
the reachability relation Reach on directed graphs is, of course, different from the reachability relation 
on undirected graphs. In a DAG-width game, the robber is allowed to run only along directed paths. The 
DAG-width dagw(^) of a graph Q is the minimal number k such that the cops have a winning strategy 
in the game dagwG (t (^). Note the difference to treewidth where the parameter of the game is defined by 
k + 1 in order to make forests have treewidth 1 . 

Kelly-width. The Kelly-width game KwG,t(^) is played on a directed graph Q in the same way as 
the DAG-width game, but the robber is, first, invisible for the cops and, second, inert ifTOl . Invisibility 
means that a winning strategy for the cops must not depend on the robber vertex and the cops can make 
assumptions about it only from their own moves. Inertness of the robber means that the robber can 
change his vertex only if a cop has announced to occupy the robber vertex in the next position. Formally, 
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a cop position is a tuple {C,R) where C is as before and R c V is disjoint with C. The cops can move to 
a robber position (C,C',R). The moves of the robber are determined by the current position, so, in fact, 
we have a one-player game: the next position is (C ,R') where R' = (R U Reach^_(cnc')(^ n C)) \ C". 
The term Reach (/? n C) describes the inertness of the robber and the term RU... means that the robber 
may still be on a previous vertex if no cop is about to occupy it. Kelly-width is defined analogously to 
DAG-width. 

Directed pathwidth. The directed pathwidth game is played as the Kelly-width game, but the rob- 
ber is not inert. Formally, the position following a robber position (C,C',R) is (C',R r ) where R' = 
Reach£_(cnc)(K n C) \ C Similar to treewidth, directed pathwidth dpw(^) of Q is the minimal num- 
ber k such that the cops have a winning strategy in dpw )t+1 (^). 

Entanglement. The entanglement game entGkiQ) GO is slightly different from the games defined 
above. First, the robber can move only along an edge rather than along a whole path. Second, he is 
obliged to leave his vertex, no matter if a cop is about to occupy it or not (thus no cops are needed on 
an acyclic graph). Third, the cops are restricted in their moves as well. In a cop position (C,v), one cop 
can go to the v, other cops must remain on their vertices. Another possibility for the cops is to stay idle. 
More formally, cop positions are of the form (C,v) and the cops can move to some position (C",v) where 
C = C, or C = C U {v} (if a new cop comes in to the graph), or C = (C U {v}) \ {w} where w e C is distinct 
from v. From a position (C',v), the robber can move to a position (C',v') where (v,v') € E and v' £ C". 
Unlike all games above, in the entanglement game, the cops do not need to play monotonically, so they 
win all finite plays and the robber wins all infinite plays. Entanglement ent(^) of a graph Q is the minimal 
number k such that the cops have a winning strategy in tntGkiQ). 

Cliquewidth was introduced in Q. Let C be a finite set of labels. A C-labelled graph is a tuple Q = 
(V, E, y) where y : V — > C is a map that labels the vertices of Q with colours from C. An a-port is a vertex 
with colour a. Let k be a positive natural number and let |C| < k. The class Ck of graphs of cliquewidth 
at most k is defined inductively by the following operations. 

(1) For every a € C, a single a-port without edges is in 

(2) If Q x - (Vi.^i.yi) and Q 2 - (^,£2,72) are in C k then the disjoint union Q x ®Q 2 - (V,E,y) of Q x 
and Qi is in Ck where V = V\ V2, E = E\ E2, and y(v) = 71 (v) if v e Vi and y(v) = 72OO if v e V2- 

(3) If Q = (V,E,y) is in Ck then the graph Q' obtained by recolouring every a-port to a ft-port is in Ck, 
i.e. Q' = (V,E,y') where y'(v) = y'(v) if y(v) + a and y'(v) = b otherwise. 

(4) If Q = (V,E,y) is in Ck then the graph Q' obtained by connecting all a-ports to all ^-ports is in Ck, 
i.e. §' = (V,E',y) where E' = EU {(v,w) \ y(v) = a and y(w) = b\. 

The cliquewidth cw(^) of a graph Q - (V,E,y) is the least k such that the graph (V,E) is in Ck- 
The following theorem is a combination of results proved in ll2l[3ll4l [T0l[T5l[T6l . 

Theorem 1. Let C be any class of finite graphs on which at least one of the following measures is 
bounded: treewidth, directed pathwidth, DAG-width, cliquewidth, Kelly-width, entanglement. Then the 
winning regions of parity games on C are computable in polynomial time. 

It follows directly from the definitions that DAG-width and Kelly-width are bounded in directed 
pathwidth. 

Theorem 2. For a graph Q, we have dagw(^) < dpw(^) + 1 and Kw(^) < dpw(^) + 1. 
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4 Friedmann's counterexamples 

We now describe and analyze the graphs that underlie Friedmann's counterexample games. For most 
of the rules, these graphs have a rather similar structure, and for reasons of space we give a detailed 
pesentation just for the examples for the switch-all rule and Zadeh's Pivoting rule. Our analysis of the 
examples for the other rules will be summarized in a table, proofs will be given in the full version of this 
paper. 

4.1 The Switch-All Rule 

For n € N\ {0}, the graph Q n - (V n ,E n ) underlying Friedmann's games against the switch-all rule can 
defined as follows. The set of vertices is 

V n := {x,s,c,r} U{f,-,a,- : 1 < i< 2n} U{di,ei,gi,ki,f,hi : 1 < i <n}. 

The set of edges and the graph £73 are given in Figured] The graph Q n consists of cycle gadgets induced 
by \di,ei\ each encoding a bit which is considered to be set if the current strategy of Player is to move 
from dj to e,- and unset otherwise. Intuitively, the strategy improvement algorithm with the switch-all 
rule starts from the state where all bits are unset and increases the bit counter by one in each round. The 
subgraph induced by all hj, kj, gj, and fj, for j < n guarantees the algorithm to swap the least significant 
bit and the subgraph induced by ay and tj, for j < 2n ensures that the other bits to change are swapped as 
well, see (9l for details. 

Friedmann showed in (9l that, for every n > 0, there is a parity game of size 0(n 2 ) with underly- 
ing graph Q n such that the strategy improvement algorithm with the switch-all rule requires at least 2" 
improvement steps on that game. 

We shall now establish upper bounds for DAG-width, Kelly-width, directed pathwidth, entanglement, 
and cliquewidth of the graphs Q n , which imply by TheoremQ] that Friedmann's games belong to natural 
classes of parity games that can be solved efficiently by other approaches than the strategy improvement 
algorithm. We start with an analysis of treewidth of Q n and show that it is unbounded on the class of 
graphs Q n . Recall that treewidth of a directed graph Q = (V,E) is defined by the treewidth of Q = (V,E) 
where E is the symmetrical closure of E. The reason for treewidth to be unbounded is that it contains 
arbitrarily large complete bipartite graphs 7C„^ as subgraphs, whereby tw(7<"„ „) = n. Indeed, every vertex 
has n direct successors, so if the robber is caught staying on a vertex v, all successors of v and v itself 
must be occupied by cops. The following lemma shows that we can find an arbitrary complete bipartite 
graph as a subgraph of a graph of the family Q n . 

Lemma 3. For every k > 0, there is some n > such that Q n has 'K^k o,s a subgraph. 

Proof. Choose n .- |"|] +k- 1. The vertex d^k-^ is the first of the vertices d\,...,d n to be connected to 

the vertices A := {aj : j < k\. The k - 1 vertices du i — T§1 + 1, . . . , Tfl +k-l are connected to each vertex 
of A as well. Neither the vertices of A are directly connected to one another, nor are the vertices of 
B := {di I i = rfl, . . ., rfl +Jfc- 1}. It follows that ~Q\A U B] is isomorphic to <K ktk . □ 

Corollary 4. For every k>0, there is some n>0 such that tw(@ n ) > k. 

Remark 5. Although the treewidth of Q n is unbounded, there is another class of graphs with bounded 
treewidth, such that the strategy improvement algorithm with switch-all rule requires super-polynomial 
time. We shall see in Section \4~3\ that for the random-edge rule, Friedmann's counterexample class has 
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Figure 1 : The graph £73 and the edge relation of Q n for the counterexample to the switch-all rule. 

bounded treewidth. In fact, that class requires super-polynomial time also for the switch-all rule, see /(£]/ 
for details. 

Now we prove that directed pathwidth of graphs Q n is bounded, which leads to boundedness of 
DAG-width and Kelly-width. 

Theorem 6. For all n > 0, we have dpw(^„) < 3. 

Proof. We describe a monotone winning strategy for 4 cops in the directed pathwidth game. First, r 
and s are occupied by two cops who will stay there until the robber is caught. In the next round, the two 
other cops expel the robber from all vertices dj, a, f, g,, hi, and ki (if he is there). For i - 1, . . . ,n, starting 
with i = 1 the cops place a cop on e,- and then visit with the last remaining cop vertices dj, f, gi, hi, and 
ki in that order. 

The robber may be on vertex x, or in the part of the graph induced by a\, t\ and c, for i e {1,. . ., 2n}. 
The cop from k n (one of those not on r or s) visits x and then a n , t n , a n -\, t n -\, ...,a\, t\ in that order 
and finally c. Obviously, the described strategy for 4 cops is monotone and guarantees that the robber is 
captured. □ 
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By Theorem[2j we get the following corollary. 

Corollary 7. For all n > 0, we have dagw(^„) < 4 and Kw(^„) < 4. 

We modify the strategy from the proof of Theorem [6] to obtain a winning strategy for 3 cops in the 
entanglement game. That is necessary, as in the latter, the cops are not permitted to be placed on a vertex 
which is not occupied by the robber. We first need a lemma from Q. 

Lemma 8. The entanglement of a graph is one if, and only if it is not acyclic and each of its strongly 
connected components contains a vertex whose removal makes the component acyclic. 

Theorem 9. For all n>0, we have ent(^„) < 3. 

Proof. Let Q r £ be the graph which is obtained from Q n by deleting vertices r and s and all adjacent 
edges, i.e. Q r jf = @ n [V n \ {r,s}]. The only strongly connected components of Q'^ where the robber can 
remain are the one induced by x and those induced by {dj,ej}. All other components are singletons 
without self-loops, so the robber can stay there only for one move. Each of the components induced 
by x or by have a vertex whose removal makes the component acyclic. By Lemma [U there is a 

strategy o~ for one cop to catch the robber on Q r ,f . Thus it suffices to prove that the cops can occupy r 
and s. They use one cop who moves according to cr until the robber is captured or visits r or s. Assume 
by the symmetry of argumentation, that the robber visits r. A second cop follows him to r and remains 
there until the end of the play. Then the first cop plays according to cr again. As r is occupied by a cop, 
the robber is either captured, or visits s. Then the last cop follows him to s. Finally, the first cop plays 
according to cr for the last time and the robber loses the play. □ 

We show that the cliquewidth of Q n is bounded as well. Graph Q n can be decomposed into n layers, 
the i-th layer is induced by vertices gi,f,ei,di,hi,kt, tn, hi-i, a2i, and an-\. We construct Q n inductively 
over i - 1, . . .,n connecting the new layer to the previous ones. Simultaneously, we connect r, and s to 
the i-th layer. Then vertex x is connected to the graph. 

Theorem 10. For all n>0, we have cw(^„) < 10. 

Proof. We consider graph Q n as consisting of layers X,-, for i e {\,...,n} where each X,- is induced by 
vertices d{, e\, f, hi, k[, g\, a2i, ct2(-i > ^2i> and ?2/-i ■ The layers are produced for i = 1,2. . . ,n by induction 
on i and connected to the previous layers. Level 1 is constructed in the same way as further layers (up 
to vertex c, which is easy to produce), so we do not describe the base case explicitly. Assume, all layers 
from Xi to X, are constructed with following labelling, which is an invariant that holds after a layer is 
constructed, see the first picture on Figure |2] (connections from ? ( - to r and s are not shown). 

• For j e {l,...,2i- 1}, all*/, and, for j e {1,. ..,/}, all dj, ej, hj, and fj have colour Done. 

• ?2; has colour T. 

• For j e {1, ... ,2i}, all ay, have colour A. 

• For j € { 1 ,...,/}, all kj have colour K, and all gj, have colour G . 

• r has colour R and s has colour S . 

We construct layer i + 1 satisfying the invariant. First, create vertex ct2r+i with colour A and vertex f2,-+i 
with colour T and connect A — > T . Then take the union of the previous layers and {a2i+i.?2(+i) and 
connect T — > 7 ', T — > R and T — > S . Relabel T — > T. Then repeat the same procedure with a% + 2 
and ?2(+2 instead of 02/+! and ?2(+i> see the second picture on Figure [2J 
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hi/T < i2i'M * dj/Done ej/Done — > hi/Done > kj/K 




hi+i/T < — a 2i+2 /A 
I 

t 2i+l /Done < aoi+i/A 



t 2 j/Done < — a 2 j/A < dj/Done ej/Done — > hi/Done > kj/K 




hi+ilT < — a 2i+2 /A < 
t 2 j+i/Done < a 2 j + i/A / . (r/R) Q/S 



di+i/D e i+ i/Done -> h M /G — > k i+ i/T' 



hi /Done < — a 2 j/A: 

\ v 
t 2 j-i/Done < a 2 j-i/A QfS 



fi+ 1 / F < — gi + \/G' 



dj/Done ej/Done > hj/Done > kj/K 



fi/Done < gj/G 



Figure 2: Construction of Q n with ten colours. 
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Figure 3: The graph Z.3 and the edge relation of Z, n for the counterexample to Zadeh's least-entered rule. 



Now we construct the subgraph C,+i induced by di+\, , hi+\, gi+i, and f i+ [ using colours D, Done, 
G, T , G' , and F. Note that colours Done and T are reused. Produce di+i, ei+\, fi+i, gj+i, ki+i, and h i+ i 
with labels D, Done, F, G' , T , and G, respectively, and connect them as needed, also to r and to s (see 
the third picture on Figured]). 

Relabel G — > Done. Build the disjoint union of Cj+i and the already constructed graph. Connect K — » 
G' (which connects all fey, for j < / + 1 to g i+ i ; dashed line in the figure), and relabel T — » /: and G' — > G. 
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Connect D — > A (which connects di+\ to all a/, for j < 2i, dotted lines in the figure) and relabel D — > Done. 
This finishes the construction of layer L !+ i. Note that the properties from the invariant hold for L !+ i. 
Finally, produce vertex x with colour T and connect all kj to ^ and x to itself. It remains to count the 
colours. We used Done, T , T , A, F, G, G' , K, R, and S , which makes ten colours. 

□ 

4.2 Zadeh's least-entered rule 

As a second example we discuss the counterexample of Friedmann against Zadeh's least-entered rule. 
The underlying game graphs are denoted Z, n - As Q n the graph Z, n can be decomposed into n layers, see 
Figure |3]for graph Z.3 and a list of edges of Z n . Vertices kj, c°, A9, b Q n , b° i0 , d®, h®, c\, A\, b l n , bj , d), 
and hj induce the i-th layer. The subgraphs induced by c®, A®, b- v b® Q , d®, and c) , At, b- v bj Q , d], hj 
are isomorphic to each other. 

A run of the strategy improvement alogrithm on Z, n simulates an «-bit counter with values from 
to 2" - 1. The difference to the switch-all rule is that the least-entered rule chooses an improving edge 
that has been switched least often. Because the lower bits of an n-bit counter are switched more often, 
the higher bits would be switched before they should in order to catch up with the lower bits. This means 
that the «-bit counter would not go through all the steps from to 2" - 1. Friedmann solved this problem 
by representing each bit i by two bits, iq and i\. The associated structures in Z. n are the gadgets induced 
by {A* ) ,/j' ) 1 ,M ) } and {Aj,bj { ,bj Q } respectively. The bit ij is considered to be set, if the current Player 

strategy chooses both edges (bj Q ,Aj) and (b 3 v Aj), and unset otherwise. In a run of the algorithm only 
one of the bits ;'o and ii is active and able to effect the rest of the counter at the time. The inactive bit 
can, in the meantime, switch back and forth from to 1 in order to catch up with the rest of the counter 
without having an effect on it. 

The counterexample contains a vertex kj in each layer i such that all ki induce an «-clique in the 
graph. This makes all values of measures that describe cyclicity (i.e., treewidth, directed pathwidth, 
DAG-width, and Kelly-width) unbounded on the class of the counterexample graphs, but cliquewidth of 
the graphs is still small. 

Theorem 11. For all n > 0, we have cw(Z. n ) ^ 9. 

Proof. The proof is very similar to the proof of Theorem [TO] We regard the graphs Z, n as consisting of 
layers X; that are induced by vertices kj, c t , A9, bj Q , b J n , dj, and hj, for i £ {1,. ..,«} and j e {0, 1}. The 
layers are constructed for i - 1,2,..., n by induction on i and connected to the previous layers. In the 
induction step, we build a new layer and connect it to the previous ones. Finally, we add the vertex s and 
the top layer, that consists of t and k n+ \, and establish the connections to the other n layers. 

As in the proof of Theorem [lOl layer Xi is constructed in the same way as further layers. Assume, 
layers from Xi to X/ have been constructed with the following labelling, which is an invariant that holds 
after a new layer is constructed. 

• For j € { 1 , . . . , i] and s, j' e {1 , 2}, all k ; have colour K, all A s . and c s have colour Done, all d s . have 
colour D, and all bj , have colour B. 

• For / € { 1 1 1 , all h° have colour H and all h 1 have colour Done. 

• h Q has colour Hi and h x . has colour H, . 

We construct the layer and connect it to the previous layers such that at the end of that process 
the invariant is true. First, produce the vertex ki+\ with colour K' . Connect the vertices h^,...,^ ^ 
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and the vertex h\ to tq+\ by connecting //-ports and H r -ports to ZT'-ports, and relabelling ///—>// and 
H r — > Done. Extend the clique consisting of the vertices &i, by connecting K — > K' and K' — > 
Thus the conncections between X,+i and the previous layers have been established. 

Next, we construct the rest of using colours C, Done, B, D, Hi and H, . Create vertices c° +1 , A? +1 , 
^+1 i' ^i+i 0' anc * ^i+i Celled with colours C, Done, B, B, D and ///, respectively, and connect them 
as needed. Repeat this procedure for vertices c) +l , A* +1 , b M { , b) +l , dj +l and A. j with the difference 
that h, + j obtains colour //,-. Build the disjoint union of these two subgraphs and the already constructed 
graph. Connect jtj+i to c? +1 and cj +1 by /f' — > C. Relabel K' — > /f and C — > Done. This finishes the 
construction of X,+i- Note that the invariant for the vertex labels is satisfied. 

After all n layers have been built, relabel ///—>// and create vertex s with colour K' (which is reused). 
Connect K' — > K and D K' . Relabel D — > Done. It remains to add vertices &„ + i and ? to the graph. 
Create k n+l and f with colours C and D. Connect H r ^C,C^D,D^D,K^D,B^K,B^D 
and /£' — > D. This produces the top layer induced by kj+i and t and establishes the edges between the 
first n layers and the vertex s and the top layer. Note that we reused the colours D, C and K' . It remains 
to count the colours. We used the nine colours Done, K, K' , C, B, D, H r , Hi and H. Hence, our claim 
holds. □ 

4.3 Other rules 

The graph complexity of the other counterexamples constructed by Friedmann is similar to the one of 
the graphs Q n . All of them can be decomposed into layers which are connected in a simple way, and they 
can be treated by the same techniques as in Theorems [6l|9] and |T0j The results are shown in Table [TJ For 
the case of snare memorisation we did not find any bound for the cliquewidth, but all the other measures 
(except treewidth) have very small values on those graphs. 



Rule / Measure 


tree 


directed path 


DAG 


Kelly 


Entanglement 


Clique 


switch-all 


oo 


3 


4 


4 


3 


10 


switch-best 


oo 


3 


4 


4 


3 


18 


random-edge 


8 


3 


4 


4 


3 


12 


random-facet 


3 


1 


2 


2 


1 


6 


least-entered 


oo 


oo 


CO 


oo 


oo 


9 


least-considered 


7 


3 


4 


4 


4 


7 


snare memory 


oo 


3 


4 


4 


4 


? 



Table 1: Upper bounds in different measures for the counterexample graph classes. 
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